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Adaptive and interacting Markov Chains Monte Carlo (MCMC) 
algorithms are a novel class of non-Markovian algorithms aimed at 
improving the simulation efficiency for complicated target distribu- 
tions. In this paper, we study a general (non-Markovian) simulation 
framework covering both the adaptive and interacting MCMC algo- 
rithms. We establish a Central Limit Theorem for additive function- 
t-H | als of unbounded functions under a set of verifiable conditions, and 

Xy^ • identify the asymptotic variance. Our result extends all the results 

reported so far. An application to the interacting tempering algo- 
rithm (a simplified version of the equi-energy sampler) is presented 
C$ . to support our claims. 



1. Introduction. Markov chain Monte Carlo (MCMC) methods gen- 
£> ■ erate samples from distributions known up to a scaling factor. 

In the last decade, several non-Markovian simulation algorithms have been 
proposed. In the so-called adaptive MCMC algorithm, the transition kernel 
{S) ■ of the MCMC algorithm depends on a finite dimensional parameter which 

r~^ . is updated at each iteration from the past values of the chain and the pa- 

^^ \ rameters. The prototypical example is the adaptive Metropolis algorithm, 

introduced in Haario et al. (1999) (see Saksman and Vihola (2010) and 
the references therein for recent references). Many other examples of adap- 
tive MCMC algorithms are presented in the survey papers by Andrieu and 
X I Thorns (2008); Rosenthal (2009); Atchade et al. (2011). 

j^ ■ In the co-called Interacting MCMC, several processes are simulated in 

parallel, each targeting different distribution. Each process might interact 
with the whole past of its neighboring processes. A prototypical example is 
the equi-energy sampler introduced in Kou et al. (2006), where the different 
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2 G. FORT ET AL. 

processes target a tempered version of the target distribution. The conver- 
gence of this algorithm has been considered in a series of papers by Andrieu 
et al. (2007b), Andrieu et al. (2007a), Andrieu et al. (2011) and in Fort et al. 
(2010a). Different variants of the interacting MCMC algorithm have been 
later introduced and studied in Bercu et al. (2009), Del Moral and Doucet 
(2010) and Brockwell et al. (2010). These algorithms are so far limited to 
specific scenarios, and the assumptions used in these papers preclude the 
applications of their results in the applications considered in this paper. 

The analysis of the convergence of these algorithms is involved. Whereas 
the basic building blocks of these simulation algorithms are Markov kernels, 
the processes generated by these techniques are no longer Markovian. Indeed, 
each individual process either interacts with its distant past, or the distant 
past of some auxiliary processes. 

The ergodicity and the consistency of additive functionals for adaptive 
and interacting Markov Chains have been considered in several recent pa- 
pers: see Fort et al. (2010a) and the references therein. Up to now, there 
are much fewer works addressing Central Limit Theorems (CLT). In An- 
drieu and Moulines (2006) the authors establish the asymptotic normality 
of additive functionals for a special class of adaptive MCMC algorithms 
in which a finite dimensional parameter is adapted using a stochastic ap- 
proximation procedure. Some of the theoretical limitations of Andrieu and 
Moulines (2006) have been alleviated by Saksman and Vihola (2010) for 
the so-called adaptive Metropolis algorithm, which established a CLT for 
additive functionals for the Adaptive Metropolis algorithm (with a proof 
specially tailored for this algorithm). The results presented in this contribu- 
tion contain as special cases these two earlier results. 

The theory for interacting MCMC algorithms is up to now quite limited, 
despite the clear potential of this class of methods to sample complicated 
multimodal target distributions. The law of large numbers for additive func- 
tionals have been established in Andrieu et al. (2008) for some specific in- 
teracting algorithm. A wider class of interacting Markov chains has been 
considered in Del Moral and Doucet (2010). This paper establishes the con- 
sistency of a form of interacting tempering algorithm and provides non- 
asymptotic L p -inequalities. The assumptions under which the results are 
derived are restrictive and the results do not cover the interacting MCMC 
algorithms considered in this paper. More recently, Fort et al. (2010a) have 
established the ergodicity and law of large numbers for a wide class of in- 
teracting MCMC, under the weakest conditions known so far. 

A functional CLT was derived in Bercu et al. (2009) for a specific class of 
interacting Markov Chains but their assumptions do not cover the interactive 
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A CENTRAL LIMIT THEOREM FOR IMCMC 3 

MCMC considered in this paper (and in particular, the interacting MCMC 
algorithm). A CLT for additive functionals is established by Atchade (2010) 
for the interacting tempering algorithm; the proof of the main result in this 
paper, Theorem 3.3, contains a serious gap (p. 865) which seems difficult to 
correct. 

This paper aims at providing a theory removing the limitations mentioned 
above and covering both adaptive and interacting MCMC in a common uni- 
fying framework. The paper is organized as follows. In Section 2 we estab- 
lish CLTs for adaptive and interacting MCMC algorithms. These results are 
applied in section 3 to the interacting tempering algorithm which is a sim- 
plified version of the Equi-Energy sampler. All the proofs are postponed in 
Section 4. 

Notations. Let (X, X ) be a general state space and P be a Markov tran- 
sition kernel (see e.g. (Meyn and Tweedie, 2009, Chapter 3)). P acts on 
bounded functions / on X and on a-finite positive measures /i on X via 

Pf(x) d ^ f f P(x, dy)f(y) , pP(A) ^ f fx(dx)P(x, A) . 

We denote by P n the n-iterated transition kernel defined inductively 

P n (x,A) d ^ J P n ~ 1 (x,dy)P(y,A) = J P(x,dy)P n - 1 (y,A) ; 

where P° is the identity kernel. For a function V : X — > [1, +oo), define the 
y-norm of a function / : X — > M by 

■ ,, def \f\(x) 

\f\v = su Pl77rv- 

zgX V[X) 

When V = 1, the V-norm is the supremum norm denoted by |/|oo- Let Ly 
be the set of measurable functions such that \f\ v < +oo. For \x a signed 
measure on (X,X), we defined \\fJ,\\ v the y-norm of \i as 

\\H\\ V = SUP |/i(/)| . 

feC v ,\f\ v <l 

When V = 1, the y-norm corresponds to the total variation norm. 
For two transition kernels P\,P2, define the ^-distance as 

\\Pi - Pi\\ v = su P y- V) II Pi(x, ■) - p 2 (x, -)\\ v . 

Let (x n ) neN a sequence. For p < q G N 2 , x p - q denotes the vector (x p , . . . , x q ). 
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4 G. FORT ET AL. 

2. Main results. Let (0, T) be a measurable space. Let {Pg, 9 G 0} be 
a collection of Markov transition kernels on (X, X) indexed by a parameter 
9 G 0. In the sequel, it is assumed that for any A G X, (x,6) h-> Pg(x,y4) 
is Af <2> T/£>([0, 1]) measurable, where £>([0, 1]) denotes the Borel u-field. In 
the sequel is not necessarily a finite-dimensional vector space. It might 
be a function space or a space of measures. We consider a X x 0-valued 
process {(X n ,9 n )} n& ^ on a filtered probability space (Q,A, {J n ,n > 0},P). 
It is assumed that 

Al The process {(X n ,9 n )} n£ ^ is (J 7 n ) neN -adapted and for any bounded 
measurable function h, 

E[h(x n+1 )\jr n ] = P 6n h(x n ) . 

Assumption Al implies that conditional to the past (subsumed in the a- 
algebra J- n ), the distribution of the next sample X n+ \ is governed by the 
current value X n and the current parameter 9 n . This assumption covers any 
adaptive and interacting MCMC algorithms; see Andrieu and Thorns (2008), 
Atchade et al. (2011), Fort et al. (2010a) for examples. This assumption on 
the adaptation of the parameter (9 n ) neN is quite weak since it only requires 
the parameter to be adapted to the filtration. In practice, it frequently occurs 
that the joint process {(X n , 9 n )} n£ ^ is Markovian but assumption Al covers 
more general adaptation rules. 

We assume that the transition kernels {Pg,9 G 0} satisfy a Lyapunov 
drift inequality and smallness conditions: 

A2 For all 9 G 0, Pg is phi-irreducible, aperiodic and there exists a func- 
tion V : X — > [1, +oo), and for any 9 G there exist some constants 
bg £ (1, +oo), Xg G (0, 1) such that 

PeV < XeV + bg . 

In addition, for any d > 1 and any 9 G 0, the level sets {V < d} are 
1-small for Pg. 

In many examples considered so far (see Andrieu and Moulines (2006), Saks- 
man and Vihola (2010), Fort et al. (2010a), Andrieu et al. (2011)) this condi- 
tion is satisfied. All the results below can be established under assumptions 
insuring that the drift inequality and/or the smallness condition are satis- 
fied for some m- iterated Pg 1 . Note that checking assumption on the iterated 
kernel Pg 1 is prone to be difficult because the expression of the m-iterated 
kernel is most often rather involved. 

A2 implies that, for any 9 G 0, Pg possesses an invariant probability dis- 
tribution ng and the kernel Pg is geometrically ergodic (Meyn and Tweedie, 
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A CENTRAL LIMIT THEOREM FOR IMCMC 5 

2009, Chapter 15). The following lemma summarizes the properties of the 
family {Pg-,6 £ 0} used in the sequel (see e.g. Douc et al. (2004) and refer- 
ences therein). 

Lemma 2.1. Assume A2. Then for any 6 £ 0, there exists a probability 
distribution ttq such that ttqPq = ttq and Tre(V) < 6g(l — Ag) -1 . In addition, 
for any a £ (0, 1], the following property holds. 

P[a] For any 9 £ 0, there exist Cg < oo and pg £ (0, 1) such that, for 

any 7 £ [a, 1], 

II D^ ^11 s* f~* ~n 

Set 
(1) Lfl = Q V (1 - pe)- 1 . 

It has been shown in Fort et al. (2010a), that under appropriate assump- 
tions, when the sequence (#fc) fceN converges to 0* £ in an appropriate 
sense, n~ l Ylk=l f(-^k) converges almost surely to ng^(f), for any functions 
/ belonging to a suitable class of functions M.. 

The objective of this paper is to derive a CLT for n~ 1 ' 2 J2k=i {/(-^fc) ~ ^e* (/)} 
for functions/ belonging to M.. To that goal, consider the following decom- 
position 



-1/2 



n 



J2if(Xk) - **(/)} = s£Xf) + s n 2 Hf) , 



where S n (/) and Sn (/) are given by 

n 

(2) ^(fl^^j;^)-^^}, 

fe=i 

71- 1 

(3) ^(/) = *- 1/2 £>*(/) -*«♦(/)} • 

fc=0 

We consider these two terms separately. For the first term, we use a clas- 
sical technique based on the Poisson decomposition; this amounts to write 
Sn (/) as the sum of a martingale difference and of a remainder term con- 
verging to zero in probability; see Andrieu and Moulines (2006); Atchade 
and Fort (2010); Fort et al. (2010a); Del Moral and Doucet (2010); Sales- 
man and Vihola (2010) for law of large numbers for adaptive and interacting 
MCMC). Then we apply a classical CLT for martingale difference array; see 
for example (Hall and Heyde, 1980, Theorem 3.2). 
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6 G. FORT ET AL. 

The second term vanishes when irg = irg^ for all 9 G which is the case 
for example, for the adaptive Metropolis algorithm (Haario et al., 1999). In 
scenarios where 9 *— > ng is a non trivial function of 9, the weak convergence 

(2) 

Sn (/) relies on conditions which are quite problems specific. 

The application detailed in Section 3, an elementary version of the inter- 
acting tempering algorithm, is a situation in which ttq^ is known but the 
expression of -rrg, 9 ^ 6+, is unknown, except in very simple examples. The 
Wang-Landau algorithm (Wang and Landau, 2001; Liang et al., 2007) is an 
example of adaptive MCMC algorithm in which 6 i— >■ tv$ is explicit. 

The results in this paper cover the case when the expression of ttq is 

(2) 

unknown: we rewrite S n (/) by using a linearization of the fluctuation 
irg k (/) — iTg^ (/) in terms of the difference Pg k — Pg^ 

?% (/) - ^ (/) = vr^ (P 6k - P 0t ) A fl , (/) + H(/, 9 k ) . 

Our approach covers much more general set-up than the one outlined in Bercu 
et al. (2009). 

By A2, for any a G (0, 1) and / G Cy a , the function Yl n >o P 9 U ~ ^(f)) 
exists and is in Cya. For 9 G 0, denote by Ag : Cy* >-> £v a the transition 
kernel which associates to any function / G Lya the function Agf given by: 



(4) Agf^Y^Pef-Mf)- 



dcf 

n>0 

The function Agf is the solution of the Poisson equation 

(5) Agf - PgAgf = f- TVg(f) . 

This solution is unique up to an additive constant (see e.g. (Meyn and 
Tweedie, 2009, Proposition 17.4.1.)). 

The convergence of S n (/) is addressed under the following assumptions 
which are related to the regularity in the parameter 9 G of the ergodic 
behavior of the kernels {Pg,9 G 0}. 

A3 There exist a G (0, 1/2) and a subset of measurable functions Mv a ^ 
Cya satisfying the two following conditions 

(a) for any /eMp, 

n 

fc=i 

(b) n~ l / 2a JXo 1 L 2 / k a Pg k V{X k ) A where Lg is defined by (1) for 
the constants Cg,pg given by P[a]. 
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A3-a controls the regularity in the parameter 9 of the Poisson solution Agf. 
Lemma 5.1 in Appendix 5 is useful to check A3-a. It relates the regularity in 
9 of the function 9 t— > PgAgf to the ergodicity constants Cg and pg introduced 
in Lemma 2.1 and to the regularity in 9 of the function 9 i— >• Pg from the 
parameter space O to the space of Markov transition kernels equipped with 
the ^-operator norm. 

A3-b is a kind of containment condition (see Roberts and Rosenthal 
(2007)): when the ergodic behavior A2 is uniform in 9 so that Xg, bg and 
the minorization constant of the Pg-smallness condition do not depend on 
9, then the constant Lg does not depend on 9 and by Al and the drift 
inequality A2, 

n-1 

n -l/2a ^ E [V{Xk+1 )} < n 1 " 1 ' 2 * {E [V(X Q )] + (1 - A)"^} -)> . 

fc=0 

Therefore, condition A3-b holds provided the ergodic constant Lg k is con- 
trolled by a slowly- increasing function of k. Lemma 5.2 in Appendix 5 pro- 
vides sufficient conditions to obtain upper bounds of 9 i— >• Lg in terms of the 
constants appearing in the drift inequality A2. 

We finally introduce a condition allowing to obtain a closed-form expres- 
sion for the asymptotic variance of Sn (/). For 9 G and / G Cy a define 

(6) Fg^Pg(Agf) 2 - [PgAgf] 2 . 

A4 For any / G M V ", n" 1 ^^^) "^ ^ 2 (f), where a 2 (f) is a 
deterministic constant. 

Assumption A4 is typically established by using the Law of Large Num- 
bers (LLN) for adaptive and interacting Markov Chain derived in Fort 
et al. (2010a); see also Theorem 5.4 in Appendix 5. Under appropriate reg- 
ularity conditions on the Markov kernels {Pg,9 G 0}, it is proved that 
n~ l Yl^kZoiFOki-X-k) — f ^e k (dx) Fg k (x)} converges in probability to zero. The 
second step consists in showing that n~ l Ylk=o f ^dki^x) Fg k (x) converges 
to a (deterministic) constant <r 2 (/): when irg is not explicitly known and the 
set X is Polish, Lemma 5.3 in Appendix 5 is useful to check this convergence. 
In practice, this may introduce a restriction of the set of functions / G £v a 
for which this limit holds (see e.g. the example detailed in Section 3 where 
My<* / Lya). 

We can now state conditions upon which Sn (/) is asymptotically normal. 
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Theorem 2.2. Assume Al to A4- For any f E My a , 
1 n 



n 
fc=l 

The proof is in section 4.1.1. When ixg = ix for any 9, Theorem 2.2 provides 
sufficient conditions for a CLT for additive functionals to hold. 

When ng is a function of 9 G ©, we need now to obtain a joint CLT 
for {S { n\f),S { n\f)) (see (2) and (3)). To that goal, we replace Al by 
the following assumption which implies that, conditionally to the process 
(^fc)fcgN' (^fc)fcgN i s an inhomogeneous Markov chain with transition kernels 
(i^,,i>0): ' 

A5 There exists an initial distribution v such that for any bounded mea- 
surable function / : X n+1 — > R, 

E [f(X 0:n ) | 9 0:n ] =[■■■[ Hdx )f (x 0:n ) f[ P Bi _ x (xj_i, dxj) . 

3=1 

Assumption A5 is satisfied when {(X n ,9 n )} neN is an interacting MCMC 
algorithm. Note that A5 implies Al. 

The first step in the proof of the joint CLT consists in linearizing the 
difference irg n — irg^. Under A2, TTg(g) exists for any g E Lya and 9 E (see 
Lemma 2.1), and we have 

ne{g) - no* (g) = ^ePeg - ng+Pg+g = ^e (Pe -Pe ic )g+ (no - ^ej Pe+g , 

which implies that (iig — ttq^) (I — Pg^) g = Tig (Pg — Pg^) g. Let / E Cya. 
Then Ag^f E Cy a and by applying the previous equality with g = Ag^f, we 
have by (5) 

(7) Mf)-*e i ,(f) = *o(Po-Po i .)A*J- 

We can iterate this decomposition, writing 

Mf) - ***(/) = W* (Pe - Pe,) A e J+ 

Tie ((Pe - PeJAgJ) - ^ ((Pg - PeJAgJ) 

Applying again (7), we obtain 

Mf) - veAf) = K0* (Pe ~ PeJ) &ej + *e (Pg - PgJ A 0i , (Pg - i\) AgJ . 

This decomposition can be iterated, which yields The first term in the RHS 
of the previous equation is the leading term of the error ng k — ixg^ , whereas 
the second term is a remainder. This decomposition naturally leads to the 
following assumption. 
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A6 For any function / € M. v a , 

(a) there exists a positive constant 7 2 (/) such that 

(8) n^^eAPe.-PeJAeJ ^M(0,7 2 (f)) 



fc=i 



(b) n-V 2 ELi *0 k (Pe k ~ PeJAe* (i\ - PeJ A e J A 0. 
Theorem 2.3. Assume A2 to A6. For any function f £ Mv a , 
1 n 



n 
fe=l 

The proof of Theorem 2.3 is postponed to section 4.1.2. It is worthwhile 
to note that, as a consequence of A5, the variance is additive. This result 
extends Bercu et al. (2009) which addresses the case when Pq(x, A) = Pe{A) 
i.e. the case when conditionally to the adaptation process (6 n ) n& ^, the ran- 
dom variables (X n ) neN are independent (see (Bercu et al., 2009, Eq. (1.4))). 
Our result, applied in this simpler situation, yields to the same asymptotic 
variance. 

3. Application to Interacting Tempering algorithm. We consider 
the simplified version of the equi-energy sampler (Kou et al., 2006) intro- 
duced in Andrieu et al. (2011). This version is referred to as the Interacting- 
tempering (IT) sampler. Recently, convergence of the marginals and strong 
law of large numbers results have been established under general condi- 
tions (see Fort et al. (2010a)). In this section, we derive a CLT under similar 
assumptions. 

Let {-K^ k ,k € {1, • • • , K}} be a sequence of tempered densities on X, 
where < fli < • • • < fix = 1- At the first level, a process (lfc) fceN with sta- 
tionary distribution proportional to it" 1 is run. At the second level, a process 
(A^) fceN with stationary distribution proportional to 7r^ 2 is constructed: at 
each iteration the next value is obtained from a Markov kernel depending 
on the occupation measure of the chain (Yfc) fceN up to the current time-step. 
This 2-stages mechanism is then repeated to design a process targeting -K^ k 
by using the occupation measure of the process targeting 7r^ k - 1 . 

For ease of exposition, it is assumed that (X, X) is a Polish space equipped 
with its Borel <7-field, and the densities are w.r.t. some a-finite measure on 
(X, X). We address the case K = 2 and discuss below possible extensions to 
the case K > 2. 
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10 G. FORT ET AL. 

We start with a description of the IT (case K = 2). Denote by O the set 
of the probability measures on (X, X) equipped with the Borel sigma-field 
T associated to the topology of weak convergence. Let P be a transition 
kernel on (X, X) with unique invariant distribution ir (typically, P is chosen 
to be a Metropolis-Hastings kernel). Denote by e G (0, 1) the probability of 
interaction. Let (Yk) ke ^ be a discrete-time (possibly non-stationary) process 
and denote by 9 n the empirical probability measure: 

k=l 

Choose Xq ~ v. At the n-th iteration of the algorithm, two actions may be 
taken: 

1. with probability (1 — e), the state X n+ \ is sampled from the Markov 
kernel P(X n , •), 

2. with probability e, a tentative state Z n+ i is drawn uniformly from the 
past of the auxiliary process {Y&, k < n}. This move is accepted with 
probability r(X n , Z n+ i), where the acceptance ratio r is given by 

10 r(x,z) d ^ 1 A \> = 1 A -±± . 

ir 1 P{z)Tr{x) 7r p (x) 

Define the family of Markov transition kernels {Pe, € 0} by 

(11) P e (x,A) d ^(l-e)P(x,A) 

+ e(f r(x, y)9(dy) + l A {x) Ml- r(x, y)} 9(dy) 

Then, the above algorithmic description implies that the bivariate process 
{(X n ,9 n )} n& ^ is such that for any bounded function h on X n+1 

E[h(X :n)\6o:n] = / v(dx )Pg (x , dxi) ■ ■ ■ P 9n _ 1 (x n _i, dx n ) h(x Q:n ) . 

We apply the results of Section 2 in order to prove that the IT process 
(^fc)fc g N satisfies a CLT. To that goal, it is assumed that the target density 
7r and the transition kernel P satisfy the following conditions: 

11 7r is a continuous positive density on X and |7r|oo < +oo. 

12 (a) P is a phi-irreducible aperiodic Feller transition kernel on (X, X ) 

such that irP = w. 
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(b) There exist r G (0, 1), A G (0, 1) and b < +oo such that 
(12) PV < XV + b with V(x) d = (vr(x)/|7r| 00 )" T . 

(c) For any p G (0, |7r|oo), the sets {it > p} are 1-small (w.r.t. the 
transition kernel P). 

(d) For any 7 G (0, 1/2) and any equicontinuous set of functions 
T C £\/7, the set of functions {P/i : h G T, \h\ Vl < 1} is equicon- 
tinuous. 

From the expression of the acceptance ratio r (see Eq. (10)) and the as- 
sumption I2-a, it holds 

where 9+ oc ir l ~P. Therefore, when 9 n converges to 9+, it is expected that 
(X/ c ) fceN behaves asymptotically as it; see Fort et al. (2010a). 

Drift conditions for the symmetric random walk Metropolis (SRWM) al- 
gorithm are discussed in Roberts and Tweedie (1996), Jarner and Hansen 
(2000) and Saksman and Vihola (2010). Under conditions which imply that 
the target density n is super-exponential in the tails and have regular con- 
tours, Jarner and Hansen (2000) and Saksman and Vihola (2010) show that 
any functions proportional to tt~ s with s £ (0, 1) satisfies a Foster-Lyapunov 
drift inequality (Jarner and Hansen, 2000, Theorems 4.1 and 4.3). Under this 
condition, I2-b is satisfied with any r in the interval (0, 1). Assumption I2-d 
holds for the SRWM kernel under weak conditions on the symmetric proposal 
distribution as shown by the following lemma. The proof is in section 4.2.1. 

Lemma 3.1. Assume II. Let P be a Metropolis kernel with invariant 
distribution ir and a symmetric proposal distribution q : X x X — > R + such 
that swpr X y\ e x 2 a ( x i v) < +°° an d the function x 1— > q(x, •) is continuous 
from (X, |-|) to the set of probability densities equipped with the total variation 
norm. Then P satisfies I2-d with any function V oc n~ T , r G [0,1), such 
that ir(V) < +00. 

For a measurable function / : X — > M such that 0*(|/|) < +00, define the 
following sequence of random processes on [0,1]: 

\nt\ 

(13) t m. S n (f; t) = n- 1 ' 2 Y, if(Xj) - 0*(/)} • 

i=i 

It is assumed that the auxiliary process {Y n , n > 0} converges to the prob- 
ability distribution 9* in the following sense: 
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12 G. FORT ET AL. 

13 (a) 6*(V) < +00 and sup n E \V{Y n )} < +00. 

(b) There exists a space N of real-valued measurable functions de- 
fined on X such that V G M and for any function / G J\f, 

(c) For any function / G J\f, the sequence of processes (S n (f,t),n > 
l,t G [0,1]) converges in distribution to ( ; y(f)B(t),t G [0,1]), 
where 7(/) is a non-negative constant and (B(t) : t G [0, 1]) is a 
standard Brownian motion. 

(d) For any a G (0, 1/2), there exist constants £0 and Q\ such that, 
for any integers n,k > 1, for any measurable function h : X fc — > M 
satisfying |%i, . . . , j/ fc )| < £* =1 F a (%), 




I [07i(dj/j) - 0*(d%-)] ^(yii • • • 1 2/fe) < -4fe "-" 



with limsup fc In Afc/(/cln A;) < 00. 

13 is satisfied when (Yf : ) k€N is i.i.d. with distribution 0+ such that 0*(V) < 
+00. In that case, I3-b to I3-c hold for any measurable function / such that 
^*(l/| 2 ) < +°°- I3-d is satisfied using (Serfling, 1980, Lemma A, pp. 190). 

13 is also satisfied when (Yfc)fcgN ^ s an asymptotically stationary Markov 
chain with transition kernel Q. In that case, I3-a to I3-c are satisfied for any 
measurable function / such that 9* ([/ [(I — Q)" 1 /]!) < +00 (see e.g. (Meyn 
and Tweedie, 2009, Chapter 17)). Condition I3-d for a (non-stationary) ge- 
ometrically ergodic Markov chain is established in the supplementary pa- 
per (Fort et al, 2011). 

The following proposition shows that under II and 12, condition A2 holds 
with the drift function V given by A2-b. It also provides a control of the 
ergodicity constants Cq, pg in Lemma 2.1. The proof is a direct consequence 
of (Fort et al., 2010a, Proposition 3.1, Corollary 3.2), Lemmas 2.1 and 5.2, 
and is omitted. 

Proposition 3.2. Assume II and I2a-b-c. For any G 0, Pg is phi- 
irreducible, aperiodic. In addition, there exist AG (0, 1) and b < +00 such 
that, for any G G, 

(14) P e V(x) < XV{x) + b9(V) , for all x G X. 

The property P[a] holds for any a G (0, 1/2), and there exists C such that 
for any 9 G 0, L e < C9{V). 

Assume in addition I3a and K[V(Xq)] < +00. Then, sup n>0 E [V(X n )] < 

+00. 
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The next step is to check assumptions A3 and A4. 

Proposition 3.3. Assume II, 12, I3a-b and E[V(X )] < +00. For any 
a £ (0, 1/2), set M.ya be the set of continuous functions belonging to Cya H 
J\f . Then, for any a £ (0, 1/2), the conditions A3 and A4 hold with 



(15) * 2 (f) = J 7r^(da;)^(x), 

where Fq is given by (6). 

The proof is postponed to Appendix 4.2.2. We can now apply Theorem 2.3 
and prove a CLT for the 2- levels IT. 

Theorem 3.4. Assume II, 12, 13 and E[V(X )] < +00. Then, for any 
a £ (0, 1/2) and any continuous function f £ Cya (~)J\f, 

1 n 

-= £ (f(X k ) - vr^/)) A A/-(0, a\f) + 27V)) , 

Vn fc=i 
where cr 2 (/) and 7 2 (/) are given by (15) and 13- c. 

The proof is postponed to Appendix 4.2.3. 

The above discussion could be repeated in order to prove by induction a 
CLT for the ET-level IT when K > 2 (see Fort et al. (2010a) for a similar 
approach in the proof of the ergodicity and the LLN for IT). Nevertheless, 
the main difficulty is to iterate the control of the L 2 -moment for the V- 
statistics (see I3-d) when (Yfc)fcgN is n °t a Markov Chain or, more generally, 
a process satisfying some mixing conditions. A similar difficulty has been 
reported in Andrieu et al. (2011). 

4. Proofs. Denote by Dy{9,9') the ^-distance of the kernels P$ and 
Per. 

(16) D v (0,e') = WPg-PpWy . 

Note that under A2, for any a £ (0, 1], any / £ Cya and any £ 0, 

(17) \A d f\y a < \f\ya L 2 e 

where Lg is defined by (1). 

4.1. Proofs of the results in Section 2. 
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4.1.1. Proof of Theorem 2.2. Let / G M v <>. Eq. (5) yields to S { n\f) 

E n (f) + R { n\f) + R { n\f) With 

E n (f) ^ -= J2{^e k _J(X k ) - Pe^Ae^JiX^)} , 
v k=i 

n 

/#>(/) <W n~V2 Y^{Pe k ^J{X k ) - Po^Ae^JiX,)} , 
k=i 

4 2) (/) = n^P 9o K e J{X ) - n^P 6n A e J(X n ) . 



We first show that the two remainders terms Rn (f) and Rn (/) converge 
to zero in probability. We have 

\P e A f(x) - PpAfffWl < \P e A e f(x) - Pe>Afff(x)\ Va V a (x) . 

Assumption A3 implies that R n (f) converges to zero in probability. The 
drift inequality A2 combined with the Jensen's inequality imply PeV a < 
\%V a + b%. By (17) and this inequality, 

\P e A e f(x)\ < \f\ va LJP e V a (x) < \f\ va L 2 B {V a (x) + b a e ) . 

Then, P 0Q A 9 J(X O ) is finite w.p.l. and n- 1 / 2 Pe A Qo f{X Q ) ^ 0. By A3-b 

and (17), n^Pe^eJiXn) A 0. Hence, R { n 2 \f) A 0. 

We now consider E n (f). Set D k {f) = A Bk _J{X k ) - Pe^Ag^JiX^). 
Observe that under Al, D k {f) is a martingale-increment w.r.t. the filtra- 
tion {J- k , k > 0}. The limiting distribution for 3 n (/) follows from martingale 
CLT (see e.g. (Hall and Heyde, 1980, Corollary 3.1.)). We check the condi- 
tional Lindeberg condition. Let e > 0. Under A2, we have by (17) 

D k (f)<\f\ va ^{^(XjO+P^^AVx)} . 

Set r d = 1/a - 2 > 0. 

1 n r i i / 1 \ T 1 n 

-E E [^(/)V fe (/)i>^|^-i] ^ (77=) -E e [^ +t (/)I^-i] 

i — i \ V / i,_i 



n 
k =i x v / k=1 

^ v * fc=i 

\ V / j,_ n 



•^fc-1 
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Under A3-b, the RHS converges to zero in probability thus concluding the 
proof of the conditional Lindeberg condition. For the limiting variance con- 
dition, observe that 



n-l 



^E E ^(/)l^-i] = ^E^(^)> 



fc=i 



fc=0 



where Fg is given by (6) and, under A4, n l Ylk=i ^ l^k I -^fc-i] 
This concludes the proof. 



^ 2 (/). 



4.1.2. Proof of Theorem 2.3. We start by establishing a joint CLT for 
(S { n\f),S%\f)), where s£ ] '(/) and S n 2) (f) are defined in (2) and (3), re- 
spectively. Similar to the proof of Theorem 2.2, we write Sn (/) = S n (/) + 
R { n\f) + Rn\f) and prove that R n 1] (f) + Rn\f) A 0. We thus consider 



?( 2 ) 



del' 



the convergence of H n (/) + S n (/). Set F^ = o~(0k, k <n). Under A5, 



E 



,!(«!-„ (/)+« 2 sl 2) (/)) 



E 



E 



3 iui3„(/) 



J ., 



ju 2 sg\f) 



Applying the conditional CLT (Douc and Moulines, 2008, Theorem A. 3. 



def 



with the filtration T n ^ = a(Yi, ■ ■ ■ , Y n , X%, ■ ■ ■ , Xk), yields to: 



(18) 



lim E 

n— >oo 



3 miS n (/) 



-rti 



~uW(f)/2 . 



observe that under A5, the conditions (31) and (32) in Douc and Moulines 
(2008) can be proved following the same lines as in the proof of Theorem 2.2; 
details are omitted. Therefore, 



E 



,i(n 1 = n (/)+« 2 si 2) (/)) 



E 



E 



jUiEn(f) 



T 



r « 2 <x 2 (/)/2^ e iu 2 si 2) (f) 



+ e -u 1 a\f)/2 E 



> 2 si 2) (/) 



By (18), the first term in the RHS of the previous equation converges to 



zero. Under A6, lim n _!. 00 E 
proof. 



> 2 5i 2) (/) 



e " 27 (f)l 2 and this concludes the 



4.2. Proofs of Section 3. 
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4.2.1. Proof of Lemma 3.1. Let 7 E (0, 1/2) and J- be an equicontinuous 
set of functions in Cy-i. Let h E J 7 , |/t|y 7 < 1. By construction, the transi- 
tion kernel of a symmetric random walk Metropolis with proposal transition 
density q(x, •) and target density it may be expressed as 



Ph{x) 



r(x,y)h(y)q(x,y)dy + h(x) / {1 - r(x,y)} q(x,y)dy , 



clef 



where r(x,y) = 1 A (ir(y)/n(x)) is the acceptance ratio. Therefore, the 
difference Ph(x) — Ph(x') may be bounded by 

\Ph(x) - Ph(x')\ < 2\h{x) - h(x')\ 

+ I \h(v)-h( x ')\ \ r ( x ,y) -r(x',y)\q(x,y)dy 



I {Ky)-K x ')) r{x',y)(q(x,y) -q(x',y))dy 
Since \r(x, y) — r(x', y)\ < 7r(y)|7r _1 (x) — 7r _1 (x / )|, 
/ \h(y)-h(x')\ \r(x,y)-r(x',y)\q(x,y)dy 

< U~ l (x) - 7r _1 (x0| / \h(y) - h(x')\ 7r(y) q(x,y) dy 



< sup q(x,y)\ Itt" 1 ^) -7r- 1 (cc')| {^{V 1 ) + V 1 {x)) . 



In addition, 



+ 



(h(y) - h(x)) r(x', y) (q(x, y) - q(x' , y)) dy 



(h(y) - h(x)) —— (q(x, y) - q(x' , y)) dy 



(h(y) - h{x')) (q{x, y) - q(x' , y)) dy 



{y--n{y)>i{(x')} 
<4vr" 1 (2; / ) \\q(x,-)-q{x',-)\L v sup \h(y) vr(y)| . 



y ex 



Since V oc it T and r E (0, 1), supx \h\ir < 1 under II. Therefore, there exists 
a constant C such that for any h £ {h £ J 7 , \h\ vi < 1} and any x, x' E X, 

\Ph(x) - Ph(x')\ < 2 \h(x) - h(x')\ 
+ C (\tt- 1 (x) - k~\x')\ + \\q{x, ■) - q(x', -)|| TV ) {V\x') + vr" 1 ^')) , 
thus concluding the proof. 
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4.2.2. Proof of Proposition 3. 3. The proof is prefaced by several lemmas. 

Lemma 4.1. Let Pg be the transition kernel given by (11). For any 
(9,6') G 9 2 and any a G (0,1], we have D V c l (9,e / ) < 2\\6-6'\\ va . For 
any positive integer n, 

(19) Dv-(0n,6n-l) < -0n-l(V a ) + ~V a (Y n ) . 

n n 

Proof. For any / G Cy a and any x G X, 



\P e (x,f)-P e ,(x,f)\=e 



r(x,y)[f(y) - f(x)][9(dy) - 6'(dy)] 



<e\\9- 9'\\ va \r(x, •) [/(•) - f(x)]\ va <2\\0- 9'\\ ya \f\ ya , 

which proves the first assertion. Inequality (19) follows from (9) and the 
obvious identity 



n-l 

n(n — 1) ^— ' 

v ' k=\ 

D 



-1 n ~ 1 1 1 

0„(/) " ^-l(/) = -. 7T V f{Y k ) + -f(Y n ) = - [f(Y n ) - 5 n _i(/)] 

n(n — 1 ^^ n n 



Lemma 4.2. Let a G (0,1). Assume II, I2a-b-c, I3a-b, andE[V(X )} < 
+oo. Then for any 7, 7' G (0, 1) and any 5 > 7, 



n" 5 



^ZV^fcA-i)V^(X fe ) Ao. 



fe=i 
Proof. By Lemma 4.1, we have 



-S' 



n J J2D v -y(e k ,e k . 1 )V^(X k )<2n- 5 J2l^k-i(V' r ) + V^(Y k )}V^(X k ) . 

k=l k=l 

By I3-b, k (V) ^ 0*(V) thus implying that limsup fc {0 fe (y 7 )+Ar 7 y' r (Yfc)} < 
00, P-a.s. . Therefore, the result holds if 



limsupn- S ^2k^~ l K[V^'(X k )] = 



fc=i 



Under the stated assumptions, Proposition 3.2 implies that sup fc E [V(Xfc)] < 
+00 and this concludes the proof. D 
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Lemma 4.3. For any 9 G 0, any measurable function f : X — >■ R in £y<* 
and any x,x' G X suc/i £/ia£ ir(x) < 7r(x') 

|P e /(cc) - Pef(x')\ < \Pf(x) - Pf(x')\ + |/(x) - /(x')| 

+ supvr \f\ va ir- p (x)-ir- fi (x') {V a (x') + 9{V a )) . 
x 

Proof. The proof is adapted from (Fort et al., 2010a, Lemma 5.1.); it is 
omitted for brevity. □ 

Proof of Proposition 3.3. Let a G (0,1/2). By Proposition 3.2, A2 
and P[a] hold. By I3-b, 

(20) limsupL^ < +oo , F - a.s. 

n 

where Lg is given by (1) with Cg,pg defined by P[a]. 
We first check A3-a. Let / G TVn Cv a - By Lemma 5.1, 

\Pe k ^eJ-Pe k _Ae k -j\ va <$ {Le k V Le^f ^ k {V a )D v ^6 k ,9 k ^) \f\ ya . 
By Lemma 2.1, Proposition 3.2 and Assumptions II, 12 and I3-b, 

(21) limsup7T0 n (V) < 6(1- A) _1 liinsup0 n (V) < oo , P - a.s. . 

n— kx) n— >oo 

Therefore, by (20) and (21), it suffices to prove that 



n- 1 ' 2 



^D Va (0 fe A_i)y Q (x fe ) Ao 



fc=i 

which follows from Lemma 4.2. We now check A3-b. By Proposition 3.2, it 
holds 

n n 

„-!/(*,) j- L ^P ek V(X k ) < n -v(*.) £ /V a [y(A fe ) + W*(V)' 

fe=l fc=l 

Under the stated assumptions, limsup n [0 n (V) + LgJ < +oo w.p.l. and by 
Proposition 3.2, sup fe E [V(Xfc)] < +oo. Since 2q < 1, this concludes the 
proof. 

The proof of A4 is in two steps: it is first proved that 

(22) - J2 F 6k (X k ) - - J2 / ^ (dx)F 6k (x) A , 

fc=0 fc=0 
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and then it is established that 

(23) J Tr 6k (dx)F ek (x) ^ J ^ (dx)i^ (x) . 

In order to prove (22), we check the conditions of Theorem 5.4 in Appendix 5 
with 7 = 2a. First observe that Ag/ 2 E C V 2 a (see (17)). We check condi- 
tions (i) to (vi) of Theorem 5.4. 

(i) and (iii) follow from Proposition 3.2 and (20). 

(ii) follows from Eq. (21). 

(iv) follows from Lemma 4.2. 

(v) under A2, we have by (17) and the Jensen's inequality 

\F e (x)\ <2\f\ 2 Va L\ P e V 2a (x) <2\f\ 2 Va L\ \P 9 V 2a \ v2a V 2a (x) . 

Hence, \F e \ v2a < 2 \f\y a Lj \P 9 V 2a \ v2a . By I3-b, the drift inequality (14) 
and the Jensen's inequality, lim sup n | Pg n V 2a L 2a < +00 w.p.l. (20) con- 
cludes the proof. 

(vi) Set F e (x) = G e (x)-H e (x) where G e (x) = P e [A e f] 2 (x) and H e (x) = 
(PeAef(x)) 2 , with 

|Ae/|ya + \A g ,f\ Va < M , e i = {L 2 + L 2 ,) \f\ ya . 

By (20), limsup n _ >0O M e „ )en _ 1 < 00, P-a.s. 
It holds 



\G e (x)-Ge>{x)\ 



< \P e (x, [Kef] 2 - [A e ,/] 2 )| + J {P e (x,dy) - P '(x,dy)} [A e ,f] 2 (y) 

< 2M e ,e> P e (x, \A e f - A e ,f\ V a ) + Mj> Dy 2a (e, 6') V 2a (x) 

< 2M 9 , e > \hef ~ hrflv \ p eV 2a \ V 2 a V 2a (x) + M 2 e , Dy 2a (9,e') V 2a {, 
By Lemma 5.1, 

\f\yl \A e f - V/lv- < 3D va (6, 6') (L e V L e >f MV a ) . 
Since w.p.l.: 

limsup {ire n (V) + M 6nt e n ^ + U n + \Pe n V 2a \ v2a } < 00 , 



n— >oo 



it follows that n 1 Ylk=i V 2a (Xk) \Go k — Ge k _ 1 |„ 2q converges to zero in prob- 
ability provided that 



n 

fc=i 



1 J2 l D v*<* (h, e k -i) + D va (9 k , e k ^)] v 2a (x k ) A , 
fc=i 
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which follows from Lemma 4.2. Similarly, it holds 

\H e (x) - H e ,(x)\ < \P e A e f(x) - P e >A e ,f(x)\ \PgAgf(x) + P e >A e ,f(x)\ 

< M e ,e< \PeA e f(x) - P e >A e ,f(x)\ {P e V a (x) + Pg>V a (x)} . 

Along the same lines as above, using Lemmas 4.2 and 5.1, we prove that 
n ~ X J2t=i V 2a {Xk) \H dk - H 6k _ 1 \ v2a converges to in probability. 

We now consider the second step and prove (23). To that goal, we have 
to strengthen the conditions on / by assuming that / is continuous. For any 
9 G 9, f ir e (dx)Fg(x) = J ir e (dx)H e (x) with 

(24) H e (x) d ^ f (A,/) 2 (x) - (P e A e f) 2 (x) . 

We prove that there exists f2* with P(fJ*) = 1 and for any uj £ + , 

(I) for any continuous bounded function h, lim n 7TQ n ^(h) = ^(/i), 

(II) the set {H 9n ( UJ ),n > 0} is equicontinuous, 

(III) S up n 7r 0nM (|^ nH | 1 /(2«)) <+oo, 

(IV) \\m. n H en ^){x) = Hg t (x) for any x £ X, 

(V) ireMHeA) < +oo. 

The proof is then concluded by application of Lemma 5.3. 

Proof of (I). Under the conditions II and I2a-b-c, I3a-b and E[F(X )] < 
+oo, (Fort et al., 2010a, Proposition 3.3.) proves that this condition holds 
for any oj £ Oi such that P(fii) = 1. D 

Proof of (II). Let Ce,pe be given by P[a]. For any constants C, v > 
and p £ (0, 1), set 

(25) e c , P ,v = {«ee s.t. C e <C, Pe < p, 9{V) < v} . 
By (20) and I3-b, 

(26) lim sup Cg n < +oo , P — a.s. lim sup pg n < 1 , P — a.s. 

n n 

and limsup n n (V) < +oo w.p.l. Therefore, it is sufficient to prove that the 
set {Hg,0 £ ®c,p,v} is equicontinuous. 

Let C, v > and p £ (0, 1) be fixed. Observe that by definition of Lg (see 
(1)) and (17) 

(27) sup \Agf\ va <\f\ va {CV(l- P y l f . 
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For any x,x' G X and any 9 G @c,p,v, 

\H e (x) - H e {x')\ < \A e f(x) + A e f(x')\ \A e f(x) - A e f(x')\ 

+ \P e A e f(x) + P e A e f(x')\ (\A f(x) - A e f(x')\ + \f(x) - f(x')\) , 

where we have used P e A e f{x) - P e A e f(x') = (A* - l)[f(x) - f(x')\. By (14) 
and (27), for any 8 G &c,p,v, 

(28) \P e A 9 f(x)\<\f\ Va (Cv(l-p)^ 1 ) 2 (y(x) + ^) . 

Therefore, since / and V are continuous, it suffices to prove that the set 
{Agf, 6 G @c,p,v} is equicontinuous. Lemma 4.3 and I2-d imply that the set 
{Pef,0 G ®c,p,v} is equicontinuous. Repeated applications of this Lemma 
shows that for any £ > 1, the set {Pjjf,9 G @c,p,v} is equicontinuous. By 
Proposition 3.2, we have for any 9 G @c,p,v, 

n-l 

\A e f(x) - A e f(x')\ < £ \p e k f(x) - P e k f(x>) 

fc=0 

+ 2p n C(l- P y 1 (V(x) + V(x')) . 

Then, the set {Agf,6 G Qc,p,v} is equicontinuous. □ 

Proof of (III). By (20) and (21), there exists 3 such that P(Q 3 ) = 1 
and for any w G O3, (III) holds if, for any constants C, v > and p G (0, 1), 

sup \H e \ 1 ^ n e {V) < +00 , 
8£<3>c, P ,v 

where &c,p,v is defined by (25). The bound on Hq follows from (27) and 
(28). By Lemma 2.1 and Proposition 3.2, sup 9eec 7r#(y) < (1 — A) -1 b v. 

D 

Proof of (IV). We first prove that for any x G X, lim n A 9n f(x) -^ 
Ag it f{x). By Proposition 3.2, for any £ > 1, 

(29) \A e J(x) - A e J(x)\ < C e J e V{x) + C e J V{x) + \n e „(f) ~ **(/)! 

+Y^\pif{x)-ptf{2 

fc=0 

From (Fort et al., 2010a, Proposition 3.3.), TTg„(f) — vr^(/) — '-$■ since / 
is continuous. In addition, following the same lines as in the proof of (Fort 
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et al., 2010a, Proposition 3.3, Lemma 4.4.), it holds P%J(x)-P$J{x) -^ 

for any k. Therefore, by (26), (29) shows that lim n Ag n f(x) — '-$■ A^/(x). 

It remains to prove that lim n Pg n Ag n f(x) ——> Pg i! Ag t f(x). This is a con- 
sequence of the above discussion and the equality 

PeAeJ(x) - PeAeJ(x) = A e J(x) - A e J(x) + vr 9?i (/) - ^(/) , 

which follows from (5). 

Combining the two results above, for any x £ X, Hg n (x) —4 Hg^(x) as 
n —7- +oo. Since X is Polish, there exists a countable dense subset P of X 
and a set ^4 with ¥(^4) = 1 such that for any x € T> and any uj G Q4, 

1™\( W )(S) = H 9ir (x) . 

The proof is concluded by the inequality 

\He n ( u )(x) - H 0i ,(x)\ < \H 6n{u}) (x) - H 6n{ui )(x)\ 

+ \H 6n{w) {x) - H 9ir (x)\ + \H 6i (x) - H 9ir (x)\ , 

the continuity of Hg^ and (II). D 

Proof (V). Since Hg^ € C V 2 a , this is a consequence of Lemma 2.1 and 
Assumption I3-a. □ 

□ 

4.2.3. Proof of Theorem 3.4- We check the conditions of Theorem 2.3. 
A2 to A5 hold (see Propositions 3.2 and 3.3) and we now prove A6. We 
first check condition A6-a. For any function / £ Cya n A/", define 
(30) 

G f (z) =eff (5 z (dz') - 0*(dzO) 7reAdx)r(x,z f ) {A e J(z') - A e J{x)) . 

Let / € Cya n A/"; note that Gf € Cya. Recall that by Eq. (11), for any 
such that 6(V a ) < +00, 



(31) Pef(x) - P e J(x) = ej [9(dy) - 9*(dy)]r(x,y) (f(y) - f(x)) . 
Then, using (30), 

n e APe k - PeJ A e J 

= e ffireAdx) Mdz) - 9*(dz)} r(x, z) [AgJ(z) - AgJ(x)} = 9 k (G f ) . 
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Therefore, 



1 n 1 n .. k 

fc=l fc=l 1=1 



re-1 



pi "-* p{k+l)/n / i \ i 

/r^Wdi + g/^ (-- ? ) S „ W d t + - S „ (1 ) 



with S„(i) = f n" 1 ^ y^g G?;(l}). Note that 

1 n 1 1 fc 

fe=l 7 = 1 






Since Gf G -Cyc, I3-a implies that sup fc>0 E[|G/|(y"/ c )] < oo. Therefore, 

•£l r(k+l)/n fn ^ _ x 



^1 ,(fc+l)/n / n 1\ 1 p 



Using I3-c and the Continuous mapping Theorem ((van der Vaart and Well- 
ner, 1996, Theorem 1.3.6)), we obtain 

i n t, r 1 

— Y^7T0APe k -PeJAeJ^7 2 (f) / t~ l B t dt . 

V™ fc=l ^o 

Since J" t~ l Btdt = f Q \og(t)dBt, f t~ 1 Btdt is a Gaussian random variable 
with zero mean and variance L log (t)dt = 2. 



We now check condition A6-b. Note that 

n n 

fc=i fc=i 

where 

(32) G^(x) d = (P 9 - PeJ Ae, (P e - P e JA e J(x) . 

We write for any x G X and any 4£N, 
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By Proposition 3.2, P[a] holds and there exist Cg, pg such that \\Pg — TTg\\y a < 
Cgpg. Furthermore, Lemma 5.2 and I3b imply that limsup n Ce n < +oo 
w.p.l. and there exists a constant p G (0, 1) such that limsup n pg n < p, w.p. 

1. Set 4 d = L^ln k\ with £ such that 1/2 + £lnp < 0. Let x G X be fixed. 

By Lemma 4.4 and I3-b, there exists an almost surely finite random vari- 
able C\ s.t. 



1 n 
v fc=i 



<C*iy Q (x)n- 1/2 ^ 4 . 



fe=i 



Since n~V2 £» =1 /* < p -i n -i/2 ££ =1 ^ 



lnp 



0, it holds 



v fc=i 



By Lemma 4.6, there exist some positive constants C2, ft*, a such that 



E 



£{< fe -< fc K>) 



\A=1 



fe=i t=i v y 



Since lim^^A = 0, there exists &* such that for k > k+, (K+l^/k 1 / 2 < 
1/2. Then, 



n , 4 



fe* -, Rinfcl 



1 " 1 ** / a \ at -, «* 1 t™" / \ at r, n 1 

1 y^ 1 y^ / k*4 \ < _L'spI y^ / «*4 A 2^1 

^2^ J, 2^ \ k l/(2a)J ~^Z^ k 2^ \ k l/(2a)J + ^ Z^ fc 

The RHS tends to zero when n —> +00, which proves that n~ l < 2 5Zfc=i{-f(? ' 
Finally, by Lemma 4.7, there exists a constant C3 such that 



E 



;£S**«) : 



1/2 



^^)iEI 



fc=i 



0. 



thus implying that n^ 1 / 2 ££ =1 P t G k^ ~^ °' 

LEMMA 4.4. Assume II and I2a-b-c. Let a G (0, 1/2). For any / G Cy* 
and 9 G 0, 

gJ(x) = f(e-e*)® 2 (dz 1:2 ) f(°\ x , Zi ,z 2 ) , 
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where G e is defined by (32); and there exists a constant C such that for any 
x £ X, 



F ( ~°\x,z 1 ,z 2 ) <C\f\ ya V aA ^ T Hx) (V a (z 1 ) + V a (z 2 )) . 



In addition, there exists some constant C such that for any £ 6 N, any 
8 G and any f £ Cya, 



*e - Pf) G{ 



v a 



<C\f\y a 



n—KB 



v a 



9(V a ) . 



dot' 



Proof. Set 7 = aA(/3/r). Throughout this proof, let Lq be the constant 
given by Ppy]. We have 



F(°Hx,z 1 ,z 2 ) d ^e 2 r(x,z 2 ) 



A 0i (z 2 ,dy)r(y,zi)(A^/(zi)-A^/(y)) 



A fli (x,dy)r(y,zi)(A e ,/(zi)-A e ,/(3/)) 



Note that |r(-,zi)|^ 7 < 1 for any z\ so that by (17), 



A 8i< (z 2 ,dy)r(y,zi)AeJ(zi) 



<Ll\f\ Va V a (z 1 )V^(z 2 ) 



In addition, since 7 — (3/t < 0, we have by definition of the acceptance ratio 
r (see (10)) 

r(x,z 2 )V^{z 2 )<V~ f {x). 



Then, there exists a constant C such that 



e 2 r(x,z 2 ) 



A 8i< (z 2 ,dy)r(y,zi)AeJ(zi) 



<C\f\ Va V a { Zl )V\x) 



Similar upper bounds can be obtained for the three remaining terms in F^ > , 
thus showing the upper bounds on F^ ^ . 
In addition, by Ppy] 



vr, - Pi) G f J(x) 



< 


TlQ - 


pi 




G{f 


ya 






ya 





V a (x) 



v° 



The proof is concluded upon noting that \G e (x)\ < C\f\ ya 9{V a ). D 
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Lemma 4.5. Assume II and I2a-b-c. Let a G (0, 1/2). There exist some 
constants C, k± and p* G (0,1) such that for any t > 1, any integers 
ui, ■ ■ ■ ,ut and any f G /V Q > 

(Pe - P 6 J {P£ - no,) ■ ■ ■ {P 9 - Po.) {PI 1 - TreJ G f e (x) 

= J ■ ■ • J (9 - ff*)® (t+2) (d^l:t +2 ) *&(*, «!,•••, Z i+2 ) 

where G e is defined in (32), and 

t t+2 

(33) |i$> t (x,si,.. • , 0t+2 )| <C|/| Va «* pF- 1% y QA ^)(x) X> Q fe) . 

i=i 

Proof. By repeated applications of Eq. (31), it can be proved that the 
functions Fu 1:t are recursively defined as follows 

(34) f| t (i,zi,- ,z t+2 ) = er(x,z i+2 )x 

J {P£(z t+2 ,dy)-P£(x,dy))Fi t - l _\(y,z u --- ,z t+1 ) , 

where i^uii = -^ an d F is given by Lemma 4.4. 

The proof of the upper bound is by induction. The property holds for 
t = 1. Assume it holds for t > 2. Set 7 = a A (/3/r); by Proposition 3.2 and 
the property P[7], there exist some constants C* and p* G (0, 1) such that 
K-^Wy-, <C B J 9it . Then, 



F£>>,z 1:t+2 ) ^CI/I^k*- 1 



Et— 1 
.7=1 u j 



4+1 



xr(x,z t+2 ) llP^-vreJL 7 ^(z 4+2 )4 \\PT - *&.\\ v ^V{x) 



<C \f\ va 4" 1 eC 6i , p^ lUj r(x,z t+2 ) {V (z t+2 ) + V (x)} . 

Since 7 < /3/r, r(x, zt^V 1 (zt+2) < ^ 7 (x) thus showing (33) with k* = 
2C\e. D 

Lemma 4.6. Assume II, I2a-b-c and 13. Let a G (0,1/2). There exist 
positive constants C,K,a such that for any f G Cy a , any k,£ > 1 and any 
x G X, 



E 






.}<(*) 



1/2 



<C|/| 



F a (x) 



e-i 



\ n 



t=i 



E (ink 



-l/(2o) 



«f 
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PROOF. For any g G Cy™, k,£>l and x G X, 



27 



HA X ) - HA*) 
= E E p l7' 



-*-ELi u j 



-*-E?=i«. 



t=l U\-.t&Ut 

= E E C^' =1 "(^-^)W-^) 

x...(P dk -P e J(P^-^)g(x) 

where W* = {ui^ttj G N, £*- =1 Wj <•£-*}. Fix £ G {1, ••• ,j 
Ui;t G W*. Then by Lemma 4.5, 



1} and 



< * E * =iUj (i\ - nj w - **.) • • • (^ - flo re - t«*) <(^) 



(« 



fe — <?*; 



i(t+2) 



(dz 1:t+2 ) / P, 



-*-E'=i« 



(x,dy)F® t (y,z 1 ,--- ,z t+2 ) . 



Assumptions I3-b and I3-d and Lemma 4.5 show that there exist constants 
C, k+, p* G (0, 1) such that 



(0 



k ~ c*J 



-*-E?=i«. 



^\dz V , +2 )j Pi 

C 



{x,dy)F® t (y,zi,- ■ ,z t+2 , 
Pi 



t „£i=i«i o'-t-^ifj-ya/x 



Finally, Proposition 3.2 implies that sup 7>0 
these results, we have for some constant C 



<J T/Q 



P^V 



r° 



< +oo. By combining 



U\.t&At 



t=\ 



SLl «i 



Note that X)«i. t ew t P* 3 * ' — 0- ~ P*) *• Furthermore, there exists a > 
such that At < t a *. Therefore, 

Pl h GUx)-PlGUx) 



£-1 



< Ck- 1 \f\ va V a {x) J2 (^ 1/a (l " P*)~ 1/a A:" 1 /^)) 



at 



t=l 



imsart-aap ver. 2009/08/13 file: fmpv_clt_arXiv.tex date: July 14, 2011 



28 G. FORT ET AL. 

This concludes the proof. □ 

Lemma 4.7. Assume II, I2a-b-c and 13. Let a G (0, 1/2) and f £ £y<*. 
Then, there exists a constant C such that for any k, I > 1 and any i£X, 



E 
Proof. We have 



p e± G e (x 



n l/2 

<C£ a \f\ va k- l V a (x) 



P {G f 9k (x) = jj (6 k - 9*f 2 (dz 1:2 )H e (x, Zl ,z 2 ) , 

with Hg(x, z\,z<i ) = Pg (x, F(°>(-, z±,Z2)) where F^ ' is given by Lemma 4.4. 
Lemma 4.4 also implies that there exists a constant C such that 

(35) \H e (x,z u z 2 )\ < C \f\ va (V a ( Zl ) + V a (z 2 )) PlV a (x) . 

By 13, the variance of P$ G e (x) is upper bounded by 

C\f\ 2 y a (PiV a (x)fk- 2 . 

The proof is concluded by application of the drift inequality (14) and 13- 
a. □ 

5. Appendix. 

5.1. Technical lemmas. The following lemma is (slightly) adapted from 
(Fort et al., 2010a, Lemma 4.2.) 

Lemma 5.1. Assume A2. For any f £ Cy a and 6,6' £ 0, 

IK, - vr , || Vo < 2{L V V L e )\ e {V a ) D v « (6, 6') , 
||A e -A<HI^<3 (L e VL e ,fTr e (V a )D Va (6,6') 

\V 

where Lg and Kg are given by (1) and (4). 



PgAg-Pg,Ag,\\y a < 5 {Lg V Lg,f ng{V a )D V <*{6, &') . 



The following lemma can be obtained from Roberts and Rosenthal (2004), 
Fort and Moulines (2003), Douc et al. (2004) or Baxendale (2005) (see also 
the proof of (Saksman and Vihola, 2010, Lemma 3) for a similar result). 
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Lemma 5.2. Let {Pe,@ G 0} be a family of phi-irreducible and aperiodic 
Markov kernels. Assume that there exist a function V : X — > [l,+oo), and 
for any 6 G © there exist some constants bg < oo, 5q G (0,1), \q G (0,1) 
and a probability measure vq on X such that 

PeV < X 9 V + b e , 

Pe(x, •) > 5 e v e (-) t {v < Ce} (x) c e d = 26 e (l - Ag)" 1 - 1 . 

Then there exists 7 > and for any 8, there exist some finite constants Cg 
and pg G (0, 1) such that 

\\P^X,-)-7Tg\\ v <C e p^V(x) 

and 

C e V (1 - per 1 <C{b e V 5 e l V (1 - A^)" 1 } 7 . 

Lemma 5.3 is proved in (Fort et al., 2010b, Section 4). 

Lemma 5.3. Let X be a Polish space endowed with its Borel a-field X . 
Let p and (mu n ) n£ ^ be probability distributions on (X,X). Let (h n ) nl -^ be 
an equicontinuous family of functions from X to R. Assume 

(i) the sequence (/i n ) n6N converges weakly to p, 
(ii) for any x G X, lim n _ >00 h n (x) exists, and there exists 7 > 1 such that 

sup n /i n (|/i n | 7 ) + p(\lim n h n \) < +00. 

Then, p n (h n ) -)• p(lim n h n ). 

5.2. Weak law of large numbers for adaptive and interacting MCMC al- 
gorithms. The proof of the theorem below is along the same lines as the 
proof of (Fort et al., 2010a, Theorem 2.7), which addresses the strong law of 
large numbers and details are omitted. Note that in this generalization, we 
relax the condition sup^ \F(-,6)\ V < +00 of Fort et al. (2010a). The proof 
is provided in the supplementary paper (Fort et al., 2011). 

Theorem 5.4. Assume Al, A2 and let 7 G (0, 1). Assume that 

(i) limsupyj^oQ Le n < 00, P-a.s. where Le is defined in Lemma 2.1 applied 

with the closed interval [7, 1] . 
(ii) limsupn^oo 7Te n (y 7 ) < 00, F-a.s. . 
(Hi) sup fc > 1 E[y(X fc )] <oo. 

(iv) ^ELi^^^wl^ftl^O. 
Let F:XxO->l/»ea measurable function s.t. 
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(v) Iimsup n _ > . 00 \Fe n \ Vl < +00. 

\V"i 



(vi) n-'ZkZl\Fe k -F ek \V^(X k 



Then, 
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